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Abstract —We study opportunistic scheduling and the sum ca¬ 
pacity of cellular networks with a full-duplex multi-antenna hase 
station and a large number of single-antenna half-duplex users. 
Simultaneous uplink and downlink over the same band results 
in uplink-to-downlink interference, degrading performance. We 
present a simple opportunistic joint uplink-downlink scheduling 
algorithm that exploits multiuser diversity and treats interference 
as noise. We show that in homogeneous networks, our algorithm 
achieves the same sum capacity as what would have been achieved 
if there was no uplink-to-downlink interference, asymptotically in 
the number of users. The algorithm does not require interference 
CSI at the base station or uplink users. It is also shown that 
for a simple class of heterogeneous networks without sufficient 
channel diversity, it is not possible to achieve the corresponding 
interference-free system capacity. We discuss the potential for 
using device-to-device side-channels to overcome this limitation 
in heterogeneous networks. 

Index Terms —Full-duplex networks, opportunistic scheduling, 
multiuser diversity 

1. Introduction 

Full-duplex wireless communication is becoming closer to 
reality, in light of recent experimental results demonstrating 
its feasibility Q, 0. Especially the development of massive 
MIMO can create opportunities for full-duplex communica¬ 
tion, since all implementations of full-duplex use multiple 
antennas. The hrst application of full-duplex in a practical 
system is expected to be in base stations instead of mobile 
devices, due to relative flexibility in design. Since mobile 
devices remain half-duplex, the uplink-downlink nature of a 
cellular system is retained, even when the base station is full- 
duplex. By serving uplink and downlink simultaneously over 
the same band, a full-duplex cellular system might have the 
potential to double the spectral efficiency. However, in order 
to realize this gain, one is immediately faced with a challenge 
that is not present in half-duplex systems: uplink-to-downlink 
interference. 

The problem of uplink-to-downlink interference manage¬ 
ment in full-duplex systems has been considered in 0 
and 0 with several interference management strategies pro¬ 
posed, based on interference alignment or message splitting. 
However, in a large network, such sophisticated interference 
management strategies can be impractical, may require tight 
coordination between nodes and a large amount of device-to- 
device CSI feedback. 
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In this paper we explore schemes that require much less 
CSI overhead by proposing a combination of opportunistic 
beamforming along with treating uplink-to-downlink interfer¬ 
ence as noise. This enables us to design an opportunistic joint 
uplink-downlink scheduling algorithm that, in a homogeneous 
network with a large number of half-duplex users and a multi¬ 
antenna full-duplex base station, asymptotically achieves the 
sum of the capacities of the isolated uplink and downlink 
systems, thus doubling the spectral efficiency. The main idea 
underlying the result is to apply random transmit and receive 
beamforming at the base station 0, and exploit the multiuser 
diversity in the system to schedule the uplink and downlink 
users that conflict the least with each other. 

Many authors (including 0^ 0^ among others) have stud¬ 
ied the problem of MIMO downlink scheduling in the many- 
user regime, and it has been demonstrated that the same 
scaling law as the optimal dirty-paper coding sum rate can 
be achieved via beamforming with scheduling. It was also 
shown that the gap between the sum rate achievable with 
beamforming with scheduling and dirty-paper coding goes to 
zero 0, 0. There has also been works that explore how 
to exploit multiuser diversity in the presence of interference, 
under multi-cell downlink pO) , and spectrum sharing cognitive 
radio in) scenarios. However, the schemes developed in these 
works are either intended for an isolated downlink system, or 
fail to provide any theoretical performance guarantees on the 
overall system throughput when translated into a full-duplex 
system, where the goal is to simultaneously extract uplink 
and downlink multiuser diversity gains while dealing with 
the uplink-to-downlink interference. Based on the existing 
literature on opportunistic scheduling, it is not clear whether 
downlink sum rate optimality through scheduling is maintained 
in the presence of uplink interference, especially when the 
uplink sum rate optimality is also sought. 

We have two main contributions in this work. First, we 
show that the asymptotic sum rate optimality in both uplink 
and downlink can be maintained individually, even in the 
presence of uplink-to-downlink interference. To achieve this, 
we develop a simple opportunistic scheduling algorithm based 
on random beamforming. The algorithm does not require the 
base station or the uplink users to have channel information 
about the interference links. Moreover, very little CSI is 
required at the base station due to random beamforming. We 
also show that the spatial multiplexing gain offered by the 
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Fig. 1. A cellular system with a full-duplex base station with M = 2 
antennas and n = 2 uplink and downlink half-duplex users. Uplink users are 
represented with white dots, downlink users are represented with black dots, 
and the interference links are represented with dashed lines. 

multiple antennas is retained in the full-duplex system when 
the number of antennas scale logarithmically with the number 
of users, as was shown for isolated downlink in ©■ 

This asymptotic decoupling result relies on there being 
sufficient channel diversity in the network. Although a ho¬ 
mogeneous network with i.i.d. fading links provides sufficient 
diversity for this purpose, such diversity may not be present 
in a real network. For instance, there might be areas in a cell 
where users are densely clustered, and some other areas that 
are mostly deserted, resulting in a lack of sufficiently rich 
channel conditions. In a full-duplex system, in addition to 
diversity in channels to and from the base station, diversity in 
interference links is also required to realize the multiuser diver¬ 
sity gains. Our second contribution is to show that for a simple 
class of heterogeneous networks, it is not possible to achieve 
such gains, by deriving an upper bound on the achievable sum 
rate. In particular, the gap between the achievable sum rates 
of the full-duplex system and the decoupled system grows 
linearly with the number of antennas and logarithmically with 
downlink SNR. Although our heterogeneous network model is 
rather simple, it features the key property of the lack of channel 
diversity. To address this limitation in heterogeneous networks, 
we demonstrate through an example that establishing device- 
to-device cooperation over orthogonal side-channels can be 
effective. 

II. Model AND Notation 

We consider a cellular system with a single full-duplex base 
station, equipped with M antennas for uplink and M antennas 
for downlink communication (see Figure 1). We assume there 
are n uplink, and n downlink half-duplex users, each with a 
single antenna, requesting communication over the same band. 
We assume the base station is able to completely cancel self¬ 
interference, but the uplink transmission interferes with the 
received signal at the downlink users. 

We first consider a homogeneous network, where all links in 
the network, including the interference links, are are generated 
i.i.d. from a CA/^(0,1) distribution; but once drawn, they 
remain hxed throughout the duration of transmission. 

The uplink channel is described by the equation 

y = iinX + z, 


where y € vector of channel outputs at the 

base station, x G is the vector of channel inputs from 

n uplink users, subject to a per-user block power constraint 
< P for a block length of T, for k = 

H„=[hi ... K]g 

is the matrix of channel gains, with each element generated 
i.i.d. according to CA/'(0,1), and z ~ CA/'(0 ,Im) is the 
vector of complex Gaussian noise. Throughout the paper, 
we use the bar notation whenever a variable pertains to the 
uplink transmission, whereas we use plain letters for variables 
pertaining to downlink transmission, including the uplink-to- 
downlink interference link gain. 

The downlink of the system is described by 

y = H*x -f GnX + z, 

where y G is the vector of channel output at the n 

downlink users, x G is the vector of channel inputs 

from M antennas, subject to a total block power constraint 
^ < P, Hn G is the matrix of channel 

gains and the matrix of interference link gains, 

with each element of the matrices generated i.i.d. according 
to CA/'(0,1), and z ~ CA/'(0,I„) is the vector of complex 
Gaussian noise. 

The set of all link gains in the network is denoted by Tin = 
{Hn,Hm Gn)- Further, the rate of ith downlink (uplink) user 
is denoted by {Rn) (Pi (Rn)), and the sum uplink and 
downlink rates are denoted by 

n n 

R-n (Rn) ='^Pi (P-n) , R-ji (Rn) = '^ Pi (Pn) ■ 

i=l i=l 

All logarithms throughout the paper are assumed to be in base 
e. We also dehne 

[n] := {/c £ N : 1 < fc < n} . 

We impose the constraint that at most M uplink users can 
simultaneously transmit to the base station, i.e., the vector x 
can only have M non-zero elements per time sloj^ 

III. Sum Capacity in Homogeneous Networks 
A. Opportunistic Scheduling 

We consider an opportunistic scheduling algorithm that 
performs random beamforming |j^ independently for uplink 
and downlink, and schedules the users whose channels best 
ht to the current beamforming patterns, and least interfere 
with each other. In particular, the base station hrst constructs a 
random unitary matrix $ and multiplies this with the received 
uplink channel output 

-f 

Note that since $ is unitary, is still distributed as 

CA/^(0,Im)- We consider the scheduling of M uplink users 

* This constraint is placed to prevent total uplink power in the system from 
growing unboundedly. 




for transmission at a given time. In particular, each element 
of the vector ^*y is assigned to a user, and the signal of that 
user is decoded from this component of the effective channel 
output, treating inter-stream interference as nois^ Note that 
this can be viewed as choosing an MxM submatrix of 
We use the following rule to choose the user Um & {1, ■ • ■, 
assigned to the mth stream; 

Um = arg min U^hkl'^ 

where 

<5m = {1 < fc < n : < en,Vr ^ m} 

for some e„ such that e„ —>■ 0 as n —>■ (xQ where is 
the TOth column of $. Note that this scheduling algorithm 
first determines a set of candidate users for stream m, by 
eliminating all users whose interference to any other stream 
exceeds a certain threshold, and then picks the user whose 
channel has the largest projection along the mth beamforming 
vector in the candidate set. We denote the set of uplink users 
scheduled in this way as T = {Um}m=i- 

Next, we consider the scheduling of downlink users, based 
on the uplink user selection. As in the uplink case, we begin 
by generating a random beamforming matrix $, and precode 
the transmitted signal with it, so that the vector of received 
signals at the n downlink users becomes 

y = + GnX + z, 

We use the following rule to choose the user Um G {1, ■ • •, 
assigned to the mth stream: 

Um = arg min \(j>mhk\^ 
k£Sm 

where 

Sm = {^ < k < n : {(pmkkf < e-n^r ^ m; 

\9kj\'^ < en,Vj e T} 

for the same e„ sequence as in the downlink, where (pm is the 
mth column of $, i.e., the candidate set of users for stream 
m are the users who receive bounded uplink interference as 
well as bounded inter-stream interference. We denote the set 
of uplink users scheduled in this way as T = {Um}m=i- 
Remark 3.1: Originally, random beamforming was consid¬ 
ered for downlink communication in order to artifically induce 
channel variations and realize the multiuser diversity effect 
0 . However, in a full-duplex system, one also needs to 
induce variations in the level of interference to each user to 
extract this gain. Since each user has a single antenna, this 
is not possible through random beamforming at the uplink 
user side. However, one can still perform receive beamforming 

^Although successive cancellation decoding can also be used, this does not 
improve our main result, hence we treat interference as noise for simplicity. 

^Note that e„ must be scaled down slow enough to ensure that |Sm| > 0 
with high probability. The exact scaling of is left unspecified here, but in 
the proof of our main result, it will be seen that 6^ = 0^ kjJTi) ^ good 
choice. 


for uplink at the base station, which results in scheduling a 
different subset of users at each time slot, which in turn causes 
variations in the aggregate interference strength observed at 
each downlink user, as desired. 

Remark 3.2: Note that the base station or the uplink users 
do not require the channel knowledge of the interfering links 
for this scheme to work. If the downlink users are able to 
track the uplink interference strength they receive (which can 
potentially be arranged by overhearing the uplink pilots), they 
can send SNR feedback for their own channels only if the 
current interference level is below the threshold, and the base 
station can perform scheduling based only on this information. 


B. Asymptotic Sum Capacity for Fixed Number of Antennas 
Define the achieved uplink and downlink gaps from indi¬ 
vidual uplink and downlink capacities as 

77 {Hn) := {Un) - {Un) 

77 (H„) := Cf (H„)-R„ {Hrf) 

respectively, where (Rn) is the sum capacity of the 

multi-antenna MAC formed by considering the isolated uplink 
system, subject to the constraint that only M users can transmit 
simultaneously, and {Hn) is the sum capacity of the 
multi-antenna broadcast channel formed by isolating downlink 
system, achieved by dirty-paper coding GD- 

Clearly, ("Hnj-I-C®*- {Hn) is an upper bound on the 

sum rate R„ {Hn) + R-n {Hn) achievable in the full-duplex 
system. Our main result is that in a homogeneous network, 
this upper bound is asymptotically achievable as the number 
of users n goes to infinity. This is more precisely stated in the 
following theorem. 

Theorem 3.1: For any <5 > 0, 


lim P (77 {Hn) + 9 {Hn) > (j) = 0 

n—^oo 

Proof: See Appendix [A| ■ 

Theorem |3.1| implies that for a homogeneous network with 
sufficiently many users, the uplink-to-downlink interference 
can be mitigated through proper user scheduling to the extent 
that the uplink and downlink systems gets asymptotically 
decoupled. The main idea underlying this result is to exploit 
multiuser diversity, in terms of both the richness in the channel 
vectors to and from the base station, and richness in the 
strength of the interfering link. 

Another important point in Theorem 3.1 is that not only 
does the sum rate has the same scaling law as the decoupled 
system (which scales as M log log n for both uplink and 
downlink, as in the isolated uplink and downlink systems 
1 ^), but the additive gap between the decoupled system 
sum capacity and the achievable full-duplex sum rate goes 
to zero. A similar behavior has been observed before for 
MIMO broadcast channels, where it has been shown that the 
achievable rate difference between zero-forcing beamforming 
and dirty paper coding goes to zero as ti —00 0 - Our result 
shows that through random beamforming, the same result can 
be obtained for simultaneous uplink and downlink, in the 
presence of uplink-to-downlink interference. 



C. Scaling the Number of Antennas 


An important assumption in Theorem 3.1 is that the number 
of antennas remain fixed as n grows. This is a crucial 
assumption, since as M grows, one would need to schedule 
a growing number of users simultaneously in order to realize 
the full multiplexing gain of the system, which would result 
in increasing uplink-to-downlink and inter-stream interference. 
Hence, an important question is whether a similar result would 
hold in the case where M is scaling. In |j^, it is shown that 
for an isolated downlink system, the spatial multiplexing gain 
can be preserved if M is scaled like O (logn). Here, we show 
a similar result for the full-duplex system, which is given in 
the following theorem. 

Theorem 3.2: 


If lim„_>oo = a for some a > 0, 


R„ (Tin) + Rn (’H„) 

hm -——- = p 

n—j-oo 2Af 

for some ^ > 0, almost surely. 

Proof: See Appendix [B] ■ 

Hence, even when the number of antennas grows to support 
the large number of users, the full sum degrees of freedom of 
the system can still be fully utilized despite the growing level 
of uplink interference, provided that the number of antennas 
does not scale faster than logarithmically in n. 


IV. Sum Capacity in Heterogeneous Networks 



Fig. 2. A heterogeneous cellular system with a full-duplex base station with 
M = 2 clusters and n = 2 uplink and downlink half-duplex users. The uplink 
and downlink users within the same cluster have the same channel, and users 
in different clusters do not interfere with each other. 


the cluster index of user i, then the interference link gain 
magnitude from user j to user i is given by 


f g, if n{i) = K{j) 
f 0, otherwise 


As in the homogeneous case, we impose the constraint that at 
most M uplink users can transmit simultaneously. Although 
this is a very simplified model, the unusual way in which the 
multi-antenna MAC and the BC interact with each other still 
makes this a non-trivial problem. 

Henceforth, this model will be referred to as a {M,h,g)- 
clustered network. Next, we derive an upper bound on the sum 
capacity of the network. 


The main idea underlying the result in Theorem 3.1 


was to 

exploit the channel richness in the network to asymptotically 
decouple the uplink and downlink transmissions. We have 
seen that the homogeneous model described in Section [n| 
provides sufficient richness for this purpose. However, such 
homogeneity may not present in an actual network. Instead, 
users may be densely clustered in certain areas, and sparsely 
located in others. In such a scenario, it may not be possible to 
simultaneously approach uplink and downlink sum capacities, 
since the lack of channel diversity might force one to schedule 
an uplink-downlink user pair with significant interference in 
between. 

In order to study this opposite regime, we consider a specific 
class of clustered networks that takes such non-homogeneity 
to the extreme, and prove that it is not possible to achieve 
the sum capacity of the decoupled system in such networks. 
Although the model of networks that we consider is rather 
specific, the main insight derived from this model might apply 
to more general heterogeneous networks. 


A. Heterogeneous Model 

We consider a network with M clusters (see Figure 2), 
hosting a total of n uplink and n downlink users that are 
uniformly distributed among them. We consider a simplified 
model where each cluster is assigned a spatial direction hi, 
with hihj — 0 for i j, and ||/ii|| = h for all 1 < i < M. 
We assume that all users (both uplink and downlink) within 
a cluster has the identical channel vector hi. Further, we 
assume an all-or-none interference model, i.e., if K{i) denotes 


B. Sum Rate Upper Bound and the Gap from the Decoupled 
System Capacity 

Theorem 4.1: If (R„,R„) is an achievable rate pair in a 
(M, h, g) —clustered network, then 


R„ -f R„ < M log 



h^P \ 

i + g^p) 


-f Mlog 



T g^ P T 2^/1 



Proof: Let and denote the vector of channel 
outputs and the vector of noise at the users in cluster m. 
Since the downlink users do not cooperate, the capacity does 
not depend on the covariance matrix of the noise at the 
downlink, as long as > 0 and the diagonal consists of 
I’s m- Hence, we assume that within the same cluster, all 
downlink users are subject to the same noise process, i.e., 
^ CA/^(0,11^), where 1 is the all ones vector. The 
noise processes at different clusters are independenj^ Under 
these assumptions, using a genie-aided argument we show in 
Appendix that for a block length of N, 


N (Rn T Rn) < max max 


N M 


EE log ( 1 + h^Pm.t + km.tg"^P + 2p/iy km,tPm,tP 


t—\ m—1 


^For a general broadcast channel, it is known that assuming independent 
noise processes gives a loose bound while using Sato upper bound |14| ; 
however, this does not matter in this case, since the links are orthogonal. 









+ log 



km,th‘^P \ 

1 + km,tg‘^P ) ’ 


( 1 ) 


where is the number of uplink users scheduled from 
cluster m at time t, Pm,t is the power allocated to mth 
channel at the base station at time t, and Af := [M] x [N]. 
It can be verified that the log terms in Q are concave and 
monotonically increasing in {Pm,t, km,t), and hence the result 
follows by Jensen’s inequality. ■ 

It is easy to see that the sum of the isolated uplink and 
downlink capacities for a (M, h, 5 )-clustered network is given 
by 


CMAC-M + ^BC 


1 + h^ 



+ Mlog{l+ h^P) 

( 2 ) 


Define the gaps form isolated systems, 77 and 77 as in the 
homogeneous case. Also set SNR := h'^^, SNR“ := q '^P, 


4.1 


SNR^ := h'^P. The following corollary of Theorem 
characterizes the scaling behavior of 77 + 77 . 

Corollary 4.1: For a (M, h, p)-clustered network with num¬ 
ber of users n > M, 


lim I ' I ^ ^ 
snr-!-oo M log SNR “ 

Proof: See Appendix [P] ■ 

Note that this is the gap between an upper bound on the sum 
capacity and the decoupled system capacity. Hence, regardless 
of the scheme applied, the achieved sum rate can get arbitrarily 
far from the decoupled system sum capacity. 


C. Potential for Cooperation over Side-Channels 

In order to remedy this inherent limitation in heteroge¬ 
neous networks, we propose the use of device-to-device side- 
channels for user cooperation to resolve the full-duplex in¬ 
terference. In particular, we consider a system architecture 
where each uplink user is capable of allocating some A S [0,1] 
fraction of its power to an orthogonal channel that is used for 
cooperation with the downlink users. The side-channels are 
modeled by 


Vi = gxj + 


with the power constraint E 




< \P, for each uplink 


user j and downlink user i such that nii) = n{j), with 
Zi ~ CAf(O.l). Hence, the side-channels can be considered 
as orthogonal broadcast channels for each uplink user (we 
assume each broadcast channel operates over a different band, 
hence they do not interfere). 

It is easy to see that cooperation over such orthogonal side- 
channels can help mitigate the device-to-device interference. 
Some schemes have been proposed in 0 regarding how 
to use such side-channels. Here, we focus on the following 
very simple scheme as an example to demonstrate that side- 
channels can indeed be effective in mitigating full-duplex 
interference in clustered networks. 

Each uplink user j replicates its symbol over the main 
channel on the side-channel, with equal power allocation, i.e.. 


Xj = Xj, and A = j. Each downlink user i subtracts the output 
received over the side-channel yt from its output in the main 
channel yi to obtain 

yi-yi = hix 4-z,-Zi 

Note that as a result, the effective channels of each uplink 
and downlink gets isolated, but the signal-to-noise ratio gets 
halved for both uplink and downlink due to power allocation 
and noise superposition, respectively. Therefore, this scheme 
can achieve 

R„ -f R„ < Mlog ^1 -f + Mlog -f 

which is easily seen to be within 2M bits of the isolated 
system capacity with the side-channels (since the side-channel 
cannot increase capacity in the isolated case 0 X independent 
of SNR. 
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Appendix A 
Proof of Theorem I3.1I 

Assume the M streams are decoded in the order (1,..., M) 
for both uplink and downlink, and denote the rate achieved on 
the TOth uplink (downlink) stream by with 

Y.m=i “‘I = R-«- Note that these 

rates are all random variables due to their dependence on "Hn, 
$ and $, but in this proof we will suppress this dependence 
for brevity. 

Define p„ := P (A: S Sm) and := P (fc S Sm) for an 
arbitrary user 1 <k <n and arbitrary 1 < m < M. Note that 
PmPn —>■ 0 as n —> 00 . 

Define ^ for a large constant c > 0, define := 

Pn — S'n and Qn := pn — S'n, and the events 


*P'm •— {I I ^ } Qm ■— ^ mUX | | > Cji ^ 


J'm •— ^ Qm ■— ^ ^ 


Let US choose tn = O 


V log n y 


Then 


PP ( 8\ ( S 

P(p + ?7><5) <Pf?7>2)+'P(^>2 


(b) ^ ^ 
< ^P 

m —1 

M 

+ E 


^ T}MAC-M t> ^ 

-R„ -R„ > — 


JLrBC _ -Dim) ^ ^ 

M ” ” 2M 


MP i - R^ 


> 


2M 


+ Mf (^Rf - Rl"*) > E 
\M ” ” 2M 

< MP ( > E 

V ^ ” 2M 

+ MF{g^^\Tra)+MF{f^) 


1 


MP( —r®'^-r: 


,(m) 


M*"" *'■” ^ 2M 

■MFig:^\Trr.) + MF{T^J 


^m 5 ^r) 


(3) 

(4) 

(5) 

( 6 ) 


where (a) and (b) follow by the fact that > 

X => Vfci (®fe > x/K) and by union bound; (c) follows 
because uniformly random selection of T> and $ from the 
space of unitary matrices induces exchangeable distribu¬ 
tions p ( 01 ,..., 0 m) and p ( 01 ,..., 0 m) on their respective 
columns; and (d) follows by the law of total probability and by 
upper bounding probabilities by one. Of the remaining terms, 
we will focus only on and here, to avoid repetition. 
The uplink counterparts of these terms, given in ([^ and 0 . 
are bounded in exactly the same way in what follows, except 
where noted. 

First consider 0 . Note that the conditioning on g^ implies 
that k* ^ Sr for r ^ m, where k* is the strongest user in 
k* = argmaxfcgs^ |0^/ife|^. This ensures that the 


user that is scheduled for stream m is not already scheduled 
for another stream, and hence, using independent Gaussian 
codebooks and allocating equal power for each downlink 
stream, 

|2 ' 


R(™) > loe fl + ^ 

” - M 1 + (2M - l)e„ 


(7) 


almost surely. Therefore, 

A 

J ~mj gn 

1+ ^ maxi<fe<„ IlfifcP 


_J_'pDPC T}("i) ^ ^ 

-R„ -R„ > — 


(a) 

< P log 


1 + 


M 1-|-(2M-1)£„ 


> 


2M 


^m ; 


(b) 


< 


maxi<fc<^ 


maxfegs^ 

|2 


> 


1 + 


2M 


l + (2M-l)e„ 


J~ m 5 gn 


ma.xi<k<r. 


+ 


maxfcgs^ 


maxfcgs^ ll/ifeP maxfegs^ \4'*mhk\^ 

*7017 On 


> 2 \ 


1 + — 
^ 2M 


(d) 

< 


/ maxi<fc<„ llfifcll , 

V maxfegs^ II fife IP ^ 

( maxfegs^ ||fife "2 


1 + (2M- l)e„ 
*7071: On 


maxfcgs^ IPm^feT 


>1 + 7 


-P m: On 


( 8 ) 


(9) 


where (a) f ollow s by using Lemma 3 in d). for downlink 


E.l 


for uplink (replace ^ with P for uplink); 


and Lemma 

(b) follows by the inequality e® > 1 + a; and by the fact that 
I > for X > y, (c) follows by the fact ab > x ^ 
a + b > 2^fx (by AM-GM inequality); (d) follows by the fact 
that Y^^=i cik > X V^i (afc > xjK), by the union bound, 
and by defining 7 > 0 such that 

(1 + (2M - l)e„) (1 + 7 )^ < 1 + 
for sufficiently large n. 

Next, we bound the terms 0 and 0 separately. Consider 
(|^ first. 

^ maxi<fc<„ ||fife|p 



V maxfegs^ II fife IP 

maxfegso II fife 


>1 + 7 


*7 ^rri 7 On 


maxfegs^ 

maxfegsc 


>1 + 7 


7 On 


maxfegs^ IPm^fcl 


>1 + 7 


*7 ^jn 7 On 


( maxfeg5o 


maxfegs^ \(j)'tnhk\ 


>1 + 7 


pm I > nqn 


= YZY E E ^{Sm=As\\Sm\>nqr.) 

s—[nq^] PlsC[n]:|^s|—s 

/ maxfeg^pififclp 


maxfcg^^ PmTifcT 


>1+7 


I-4.S I > nqn^ Sm — A; 


































(c) 


1 


1-e; 


^ ^ = >«(?„) 
s=[ngTil •4sC[n]:|^3|=s 
maxfcg^c ||/ife||2 


maxfeg^^ 


>1+7 


l-^S I ^ ’^Qn ; Sjji — 7 


|3r ^ m : > £„ V 3j &T ■ |/ife* 


> Ei 




(d) 


1 


1-e: 


^ P(|5,„| = s||5„| >ng„) 

s=[n(/„] 

maxfcg^c ||/ifc||2 


maxfeg^^ 


>1+7 


|A|=^ 


|3r ^ m : 


(e) 

< 


1 


max 


1-e' 


max 


> e„ V 3j € T : \hk-j\ 
2 > 1 + 7 


> e. 


fcGyl' 


k^A, 


\(t>*mhk 


{3r^ 


(f) 

< 


m : > e 

1 / max 


fcGyl' 


V 3j G T : I 


> e. 




(l-En)^ \max^^g_4|(/)^/ifc|^ 


>1 + 7 


(g) 

< 


1 


P (uia.-^\(j)*^hk\^ < 21og ( ' 

Vfce+ Vlog(^9») 


(1-e'J 

max I 
k^A’^ 

' 2 log (ng„) + (2M + 2) log log (nq„) 


P ( max ||/ife||^ > 21og(ng„) + (2M + 2) loglog(ng„) 
\keA‘= 


2 log {nqn) - 2 log log (nq^) 


>1 + 7 


(h) 

< 




1 


O 


/ 1 


Vlog (nqn) 


+ 0 




logn 


where 


(b) is by the fact that for events B-, P(A|i?) < 
and by Lemma E.7 where e^ —> 0; 


subset of users such that >l = 

(f) is by Lemma [E77{ 

(g) is by the fact that for events A,B,C-, P(yl) < 
P (i?'') + P (C^) + P {A\B, C) by union bound and law 
of total probability; 


(h) is because of Lemmas E.5 and E .6 and by the fact 
that the last probability is that of the elements of a 
deterministic sequence converging to 1 being larger than 
1 + 7 for sufficiently large n; 

(i) is because we chose e„ = O ^nd thus 

O f logM-i n ) Lemma E.2 for the uplink and = 


by Lemma 


E.3 


^log 2 M-l 

Next, we move on to analyze the term (|^. 


maxfcgs^ 


for the downlink. 


@<1 

y maxfegs^ \(j)^hk\ 

(^) p / maxfcgg^ 

y maxfegs^ 
p / maxfcgg„. \(l>tnhk\ 
y maxfegs^ \(j)tnhk\ 

I /* 7 I 2 MCr 

max|<^„fifc| < — 


(c) 

< 


(d) 

< 


1-e: 

1 


1-ei 


1 — exp < — 





(a) follows by the fact that the ratio can be larger than 
(1 + 7 ) only if the maximum in the numerator occurs for 
a k G 5'° (otherwise the ratio is 1 ); 


^ (log 71)”/^°®”/ 

where 

• (a) follows by the fact that $ is unitary and thus ||‘h/ifc|| = 
\\hk\\; 

• (b) is by construction of the set Sm', 

• (c) is by Lemma IRTI 

• (d) is by Lemma ]e!^ 

• (e) is because we chose e^ = thus = 


O l log2A/-l 

by Lemma |E.2| 


by Lemma 


E.3 


and = O 


(c) is because Sm = As implies the newly conditioned 
event, which is that for any user outside the set As, there 
must exist an r such that \(j)*hk»\^ > e„ or an uplink 
user whose interference strength is larger than e„, by the 
construction of the set Sm, where we have defined k* := 
argmaxfcg_ 4 c |j/ifc|p (for the uplink case the second part 
of the event is removed); 

(d) follows by the fact that the probability on the right- 
hand side does not depend on As, as long as |>ls| is 
fixed, owing to the fact that the user channel vectors hk 
are i.i.d.; 

(e) follows because the given probability is a monoton- 
ically decreasing function of s, and A is any arbitrary 


Therefore (|^ goes to zero as n — > 00 . Next, we consider the 
terms in (|^. Note that the first term goes to zero since 


P(^1-P)= (l-exp{-|}) 

= 0 ^ ^ 


nqn 


^(logn)"/‘°®”; 

by Lemma [E(4| and by the choice of e„. The second term in 
goes to zero by weak law of large numbers for triangular 
arrays p^ , applied to the binomial random variable jiSml with 
mean np„. 

Since all terms in (|^, Q, Q, and (|^ go to zero, the result 
follows. 

















































Appendix B 
Proof of Theorem I3.2I 


Appendix C 
Proof of 0 


Let us choose e 
proof of Theorem 


„ = e > 0, i.e., a constant. Then, as in the 
3T 


P (R-n + R-n < 


M 


2M/3) < ^ P 

m—1 

+ p 


(r(-) 

(r(-) 


</ 3 ) 

</ 3 ) 


by the fact that < x ^ V^i <xlK) and 

by union bound. We only consider the first term, associated 
with downlink. The uplink term is bounded the same way, 
except where noted. By law of total probability, and by upper 
bounding probabilities by one. 


(rm </?) <pra+p(0^ij-™) 


+1 


( 


R 


(m) 


< 






( 10 ) 


Since e„ is a constant, P goes to zero exponentially by 
Hoeffding’s inequality. V {Q^\Tjn) is upper bounded by 

^l + 21oga 


by Lemmas E.4 and E.3 where a = (1 — exp {—e/2}). Note 


that the last term goes to zero exponentially if 1 + 2 log a > 0 , 
which is satisfied for sufficiently large e > 0. We consider the 
first term. Conditioned on a different user is scheduled 
for each stream, hence 


'(rW </3 


m 1 Qn 


- \ 1 + {2M-\)e j 

y 1 p ( maxfcgg^ 

<=' ' l + (2M-l)e 


< 

(b) 

< 


1 - e: 
1 


‘P' m ; Qn 


< 


Frr 


1 - e: 

1 


-P max|(/)//ifc| </?e(l + 2elogn) 

\ ‘^TTL 


J~ ri 


1-eL 


1 — exp < — 


/3e (1 + 2elogn) 


0 (e-^) 


where (a) follows by Lemma E.7 (b) follows by Lemma E.4 


and (c) follows, for some 0 < 7 < 1, by Lemma |E.3| with 
the choice e„ = e, and by letting M = a log n for sufficiently 
small a > 0. Since all terms in ( [TOl i go to zero exponentially 
as n —)■ 00 , 


P (R„ + R„ < 2M) < 00 

n 

and thus by Borel-Cantelli Lemma ||T 6 |, the result follows. 


Let us denote message of the fcth uplink user as Wk, the 
message intended for the fcth downlink user by Wk, and for 
any set S, define Ws = {Wk '■ k G S'}. We also define = 


^(m) _ where 1 is the vector of ones, i.e., vl'"’ is the 

vector of interference signals at the downlink users of cluster 
m at time t. Set Vt = ,... ,v. 

We consider a block length of N, and as explained in 


(m) 


(M) 


Section 


IV 


assume z) 


m) 


CAf{0, 11^), where 1 is the all 
ones vector, for m G [M]. We also assume that the downlink 
users within each cluster cooperate, since this cannot reduce 
capacity. Then, by Fano’s inequality. 


N (R„ + R„) < / (fL[„] ■,y^)+I (%]; y^) 

< I (VL[„] ■,y^)+I (%] ■,y^,y^, fL[„]) 

+ / (%] ■,y^,y^\ ) 

= h (y^) - h (y^ |1L[„]) + h {y^, y^ |1V[„]) 

- h {y^,y^ I^H’^h) 

= h{y^)+h{y^\ 1V[„] ,y^)-h{y^,y^\ fL[„], ^[„] ) 

N 

-h{yt,yt |VL[„],lV[„],y‘"\y*"^) 

N 

Yh {yMY +h{yt 

- h{yt,yt \ W[n],W[n],y*~'^,y*~^,xt,xt) 

- ^ ivt \yt,Xt) - h {zt,Zt) 

N 

= Y ^ ^ 1^*’ Xt)-h (zt) - h (zt) 

N 

^Y^ ^ ^y* M-h (zt) - h {zt) 

z ^ AT / M \ 

< ^ ) +k{yt\vt)-h (zt) 

\m^l / 

- eK'.'”’)) 

where (a) follows by independence of messages; (b) follows 

by the fact that xt is a deterministic function of 

and Xt is a deterministic function of W[n]', (c) follows because 

conditioning reduces entropy and by subtracting Xt and Xt 

from yt and yf, (d) is because vt = yt — ^H*x and by 

independence of uplink and downlink noise; (e) is by the fact 

that conditioning reduces entropy, and that noise processes at 

different clusters are independent. Since {hm} are orthogonal, 

[yi^^j can be uniquely expressed as yt = EXi 

i.e., this transformation is a bijection. Let us define the matrix 























H ■= 




. Then 


h {yt) = h = h {y[^\. .., + log 


H 


since H is unitary. Similarly, zt = X]m=i and 

are still distributed i.i.d. CM{0, 1). Hence, also using 
the fact that conditioning reduces entropy. 


:-(m) 


N M 


-(m) (m) 

Vt vl 


given by 


K — / + kjn.tP 


9 


[ h*m 9* 


Hyt\vt) = log27re \Ky\^\ , 


where is the conditional covariance matrix of y^^"’ given 
^(m) evaluating the differential entropy terms with 

Gaussian input distribution^ and using the fact that ^ 

CA/"(0,11^), we find •ID- 

APPENDIX D 

Proof of CorollaryI4.1I 


r] + 7] > M log 


1 


— M log I 1 + 


_ SNR'’ 
1+SNR° 

SNR“ 


1 + mSNR 


- M log 3 


g(SNR) 

^ ^SNR/’ I - log ( 1 


1 + 


SNR° 


1 + mSNR 


- log 3 


Lemma E.l: 

■dMAC-M 


Appendix E 
Auxiliary Lemmas 


{Pn) < M log [1 + P max ||hfc| 

l<fc<n 


N (R„ + R„) < 5] ^ h (y^l) + h (y-^1 ) 

t—1 m—1 

Let k^ t denote the number of uplink users scheduled in 
cluster m at time f, with X]m=i ^ni,t < Mi for all t. Note that 
given any power allocation, there is a covariance constraint on 


Proof: The capacity of a MIMO MAC with a per-user 
power constraint P, and an active user constraint M is given 
by 

(H„) = max log \Im + PHj,H*^\ 

AC[n]:|A|=M 


= max log 

AQ[n\-.\A\=M 


Im + P 'y ^ hkh*). 


keA 


Using the inequality |A| < (which is a direct 

consequence of AM-GM inequality applied to the eigenvalues 
of A), 


R' 


MAC-M 


(Rn) 


= max M log 

A<Z[n]-.\A\=M 


'tr(/M + PEfcG,4 ^kK) 


Hence, h{yt\vt) is maximized when {yt,vt) ~ CAf {0, K), 
with 


= max M log 1 + 

AQ[n]-.\A\=M \ 


M 

PEkGA^^i^kK) 


= max M log ( 1 + P 

A<Z[n\-\A\=M ’ 

= M log ( 1 + P max 


M 

r^Ek^Afhkr 


M 




AQ[n]-.\A\=M M 


Using Theorem 4.1 and (|^, p + p can be lower bounded by 

' l + SNR'^ \ 


< Mlog ( 1 + P max |jhfc| 

l<fc<n 


Lemma E.2: Lor an arbitrary uplink user 1 < fc < n, and 
arbitrary 1 < m < M, 

P (fc G Sm) = (1 - exp {-e„/2})““^ 


If we use the notation /(SNR) = y(SNR) to mean that 
limsNR->.oo = 1, then it is easy to see that 


1+SNR“ / 

= logSNR“-logSNR“-i 
= log SNR 

Hence, the result follows. 

^We evaluate assuming a joint Gaussian distribution on xt and 

xt with arbitrary coiTelation, since xt is a function of both Vf^[n] Vt-i- 


Proof: 

V{k&Sm)=f' < en, Vr ^ m 

[P(|^thfc|' <£„)]“”' 

(b) . , , .M -1 

= (1 - exp{-e„/2}) 

where (a) follows by the fact that the components of ^hk are 
i.i.d. distributed because $ is unitary; and (b) follows by the 
fact that is x^(2) distributed. ■ 

Lemma E.3: Lor an arbitrary downlink user 1 < fc < 
and arbitrary 1 < m < M, 

P (fc G Sm) = (1 - exp {-e„/2})^^“^ 

Proof: 

V{k&Sm) 

*rhk\^ < e„, Vr 7^ m\ \hkj\^ < e„, Vj G t) 





























Proof: 


= E P('r = ^) 

AQ[n\-.\A\^M 




r = yt 


•p(i</.: 

F{T = A) 

AQ[n]:\A\ = M 

■ P <enyrf= m; \hkjf < e„, Vj e 

= P (l^r^fcl^ <£n'^rf= m; \hkj\^ < e„, Vj G Vl) 

[f (\f*hk\^ < en)Y'~' [V {\hki\^ < 

= (1 - exp{-e„/2})^^^“^ 

where (a) follows by the fact that T is a function of 
{fm^k}^ and all links are independent, and thus the event 
{r = A} is independent, (defining A to be an arbitrary 


subset of uplink users s.t. 


A 


max < X 

keSm 


l-S'ml > < (l - e 2 )”'^" 


Proof: 


P 1 Wmhk\ < X 


I Srti I ^ kiqji 


= E E A||5„| >n9„) 

s=[n(p—5)] ^sC[n]: 

• P [ max < X 

\ k^As 


I vis I V TlQn^ Sjji — A^ 


(a) 


= Y P(l>5'm| = S IIS'™! > ng„) 

= [ng„] 


(max < X 


(b) / „ 

< P max If^hkl < X 
\kGAs 

(J 


|Vls| = S 

\As\ = nqn 


max Xi < 2 log N — log log N 

l<i<N 

N 


= [P {Xi < 2 log TV - log log TV)] 

= (1 - exp {- log TV + log log N}f =(^1- j 

= exp|TVlog (^1- 


N 


= exp 




TV 


iV2 


1 

< — 
- TV 


Lemma E.6: Let Xi,... ^Xm be i.i.d. x^(2TlV) distributed 
random variables. Then for TV sufficiently large. 


max Xi> 2 log TV + (2M + 2) log log TV 

l<i<Af 


= TVf); (b) follows because 


the components of ^hk are i.i.d. distributed and all links are 
independent; and (c) follows because both \(j)\hk^ and \hki^ 
are x^( 2 ) distributed. ■ 

Lemma E.4: 


= 0 


1 


Jog TV ^ 

Proof: Chernoff bound for a x^(2TVf) random variable Z 
is given by 


/ 'V \ 

for X > 2M. Then, assuming TV is large enough. 


max Xi> 2 log TV + (2M + 2) log log TV 

l<i<Af 


= 1 — P ^ max Xi <2 log TV + {2M + 2) log log TV 

= 1 - [P(Xi < 21ogA^+ (2TW + 2)loglogTV)]^ 

= 1- [l-P(Xi > 21ogTV + (2TW + 2)loglogTV)]^ 
A /21ogTV + (2TVf + 2) loglogTV 


2M 


exp 


^ 21ogA^ + (2TlV + 2) log log 


2M 


M' 


= 1 - 1 - 


(2 log TV + (2TVf + 2) log log ' 


N 


N 


{2M)’^ N\og^+^ N 
(2 log N + (2M + 2) log log N) 


= 1 — exp < — 


M 


(2TVf/e)“ log^+^ TV 


where (a) follows by the fact that the probability on the right- 
hand side does not depend on Vis as long as jVlsj is fixed, 
owing to the fact that the user channel vectors hk are i.i.d., 
and since $/ifc ~ CA/'(0,/); (b) follows because the given 
probability is a monotonically decreasing function of s; and 
(c) is because are i.i.d. x^( 2 ) distributed; ■ 

Lemma E.5: Let Xi,...,Xm be i.i.d. x^(2) distributed 
random variables. Then 


max Xi <2 log TV — 2 log log TV < — 
l<i<N ^ ^ ^ j - N 


(ji) / g \ M (21ogTV +(2T\T-|-2)loglogTV) 
- \2MJ 


M 


log^+^ TV 

mO (log'^iv) 1 


= ( — 1 ^ ^ Q 

\2M) \og^^+^N V log TV 

where (a) is by the inequality 1 — x < e~^. ■ 

Lemma E.7: If TV„ —00 and e„ —>■ 0 as n — 00 , then for 
i.i.d. x^( 2 ) distributed Xi,..., Xjq^, 

lim P 1 max Xu > e„ | =1. 
n—^oo \l<k<Nn J 


The proof for Lemma E.7 is trivial and omitted here. 



























